Scene Text Detection Using Attention with Depthwise Separable Convolutions

نویسندگان

چکیده

In spite of significant research efforts, the existing scene text detection methods fall short challenges and requirements posed in real-life applications. natural scenes, segments exhibit a wide range shape complexities, scale, font property variations, they appear mostly incidental. Furthermore, computational requirement detector is an important factor for real-time operation. To address aforementioned issues, paper presents novel using deep convolutional network which efficiently detects arbitrary oriented complex-shaped from scenes predicts quadrilateral bounding boxes around segments. The proposed designed U-shape architecture with careful incorporation skip connections to capture complex attributes at multiple scales. For addressing input processing, uses MobileNet model as backbone that on depthwise separable convolutions. design integrated attention blocks enhance learning ability our detector, where are based efficient channel attention. trained multi-objective formulation supported by text-aware non-maximal procedure generate final box predictions. On extensive evaluations ICDAR2013, ICDAR2015, MSRA-TD500, COCOText datasets, reports F-scores 0.910, 0.879, 0.830, 0.617, respectively.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions

Depthwise convolutions provide significant performance benefits owing to the reduction in both parameters and mult-adds. However, training depthwise convolution layers with GPUs is slow in current deep learning frameworks because their implementations cannot fully utilize the GPU capacity. To address this problem, in this paper we present an efficient method (called diagonalwise refactorization...

متن کامل

Natural scene text localization using edge color signature

Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...

متن کامل

Object Attention Patches for Text Detection and Recognition in Scene Images using SIFT

Natural urban scene images contain many problems for character recognition such as luminance noise, varying font styles or cluttered backgrounds. Detecting and recognizing text in a natural scene is a difficult problem. Several techniques have been proposed to overcome these problems. These are, however, usually based on a bottom-up scheme, which provides a lot of false positives, false negativ...

متن کامل

Reading Scene Text with Attention Convolutional Sequence Modeling

Reading text in the wild is a challenging task in the field of computer vision. Existing approaches mainly adopted Connectionist Temporal Classification (CTC) or Attention models based on Recurrent Neural Network (RNN), which is computationally expensive and hard to train. In this paper, we present an end-to-end Attention Convolutional Network for scene text recognition. Firstly, instead of RNN...

متن کامل

Text Detection in Natural Scene Images using Spatial Histograms

In this paper, we present a texture-based text detection scheme for detecting text in natural scene images. This is a preliminary work towards a complete system of text detection, localization and recognition in order to help visually impaired persons. We have employed spatial histograms computed from gray-level co-occurrence matrices for texture coding and three classifiers have been evaluated...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app12136425